Rank | Count | Beginning |
---|---|---|
12208 | 1207 | كما |
24099 | 732 | وقد |
22897 | 519 | وفي |
23466 | 402 | وقال |
11299 | 373 | في |
26223 | 340 | ومن |
15448 | 336 | من |
4439 | 217 | أما |
17502 | 205 | وأضاف |
18108 | 196 | وأكد |
24923 | 170 | وكان |
16971 | 169 | و |
1370 | 166 | إذا |
13529 | 149 | لا |
23698 | 149 | وقالت |
8563 | 148 | حيث |
4782 | 144 | إن |
16577 | 143 | هل |
8325 | 136 | حضر |
10524 | 133 | على |
74 | 129 | • |
16381 | 129 | هذا |
25351 | 128 | ولا |
28829 | 128 | يجب |
18841 | 122 | وأوضح |
22596 | 121 | وعلى |
17282 | 118 | وأشار |
21430 | 114 | وذكرت |
4802 | 109 | أن |
13757 | 109 | لجنة |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV